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© Secretion of Insultn-flke growth factor 1 1n E. coll. 

@ A method for bactertefly producing IGF-t Is disclosed In 
which Gram-negative bacteria are caused to express a gene 
consisting of a lamB or ompF signal sequence operatively 
joined to a DNA sequence encoding IGF-I and producing IGF -I 
which Is secreted into the pertplasmlc space of the bacteria. 
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Description 

SECRETION OF INSULIN-LIKE GROWTH FACTOR-I IN E. COLI 
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space as a mature protein. 

10 gSSlaig^g of ggsrcmiQF-al ha ve been Isolated trom various animal species and are believed to be 

tssssxsss^ ^ — rs and van de 

^,,^^cragM 

* '^IralsourcesoMGF-lt^^ 

and groups have turned to recombinant ^^^^^**^^^**^****^*^^^^^^^^^^^J^^JJJ^^ vwAnrtb^fl^aT 155.655 (published 
recombinant DNA methodologies can ^J™"?'^ 135.094 (published 

September 25. 1985) which describes production ol ^5j^ a ^" s ,on protein In £ coif. 130.166 
30 March 12. 1985) which describes a process tor produc ing IGF-1 a s a ~ S, ^ 0 P D , asm , c h^on protein In 
30 ^published January 2. 1985) which ^^J^S^^^t » SSSEnSn* methionine In £ 
kWococcusa^usand produc lor ^.^^^^T^s^or producing IGF-I In the yeast 

35 ^t:m^ 8nd Bue " et (,985) 

5£h describes Intracellular production of IGF-. as a mature «" d ^ a S S uch as £ coff has not yet 
A process for producing IGF-I as a secreted ^J^!S^£^^ « «« em P^ 
been disclosed. Such a process ^ derfrabte as ^ J ^"protect the mature IGF-I protein 

40 cytoplasmic production have required the use of fusion prot ™ ^^_=? Aoollcatlon publication number 
product from degradation by cytoplasmic ^J^J^^^^^^^^S^ an additional 
135.094 s</p ra andBue.letal1984).FustonproteJ «^^^^S^^ ^ J ^pU^ m«ll^ 
production step consisting of release of matu^ protein and/or can require 

by enzymatic or chemical means, which means can _ t "*^_™J^ tlon publication number 135.094 

4S modifications In IGF-I protein structure (see ^-9; European ^ E would afford a 
s«p«). Secretion of mature IGF-I Into the periplasm of s ™*°™^. g *"l^™ of tne nos , cells thereby 
clpLrtmentalizatlon of the IGF-I ^^J™^? d i £ES "SSEXSE periplasm* space 

when yeast host cells are employed. ( . - coft lnclude U.S. Patent No. 4,336,336 (filed 

Examples of various secretion systems described for ^use In *™'™™% 6g (pob |ished June 11. 1986). 

^(^^^^ 

which proteins are normally exported n ^^f m ^ B ^ S sequence s are commonly referred 

heterologous proteins to noncvtoplasml^ ,he cytoplasm to noncytoplasmlc 

to as "signal sequences" as they signal ^^° f £^ theimer membrane, periplasmic 

locations. In Gram-negative bacteria, such J^«V^^ ^™ during transport of proteins out of the 
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the desired noncytoplasmic location. Site-specific removal of the signal sequence, also referred to herein as 
accurate processing of Ihe signal sequence, is a prelerred event if the correct protein is lo be delivered lo Ihe 
desired noncytoplasmic location. 

Attempts to employ such previously described signal sequences and secretion systems for delivery of 
heterologous proteins Into the E. coli periplasmic space have, however, not always been successful, (see 
Kadonaga. J. et al.. 1984; Ohsue. K. et al., 1983). Such results Indicate that further studies are needed to clarify 
the secretion mechanism before secretion of heterologous proteins in such Gram-negative bacteria as E. coli 
can become a predictable phenomenon. The term "heterologous protein" Is herein understood to mean a 
protein at least a portion of which Is not normally encoded within the chromosomal DNA of a given host cell. 
Examples of heterologous proteins Include hybrid or fusion proteins comprising a bacterial portion and a 
eucaryotlc portion, eucaryotlc proteins being produced In procaryotlc hosts, and the like. 

Summary of the Invention 

The present Invention Is directed to methods and compositions for producing Insulin-like growth factor-l 
(IGF-I) in Gram-negative bacteria whereby the IGF-I so produced is secreted Into the periplasmic space of the 
bacteria. 

In one embodiment of the present Invention, the methods comprise causing expression of a gene in 
Gram-negative bacteria, wherein the gene comprises a DNA sequence encoding a major outer membrane 
protein signal sequence operatrvely Joined to a DNA sequence encoding IGF-I, so that the IGF-I produced 
within the bacteria Is secreted Into the periplasmic space thereof. It has been discovered that the lamB and 
ompF signal sequences provide for a site-specific release of the signal sequence from the IGF-I protein so that 
the IGF-I protein can be released Into and accumulate at relatively high levels in the periplasmic space of 
selected bacteria. 

Also provided are various synthetic DNA sequences and transformed bacteria useful In practicing the novel 
methods as provided herein. 

Brief Description of the Figures 

In the following diagrammatic representation, the directional arrows represent the 5' to 3' orientation of a 
DNA coding sequence(s) (e.g. direction of transcription of a DNA coding sequence), "amp" denotes an 
amplcillin resistance gene, "P" denotes a promoter sequence operable In E. coli. "R.B.S." denotes a rib o some 
binding site, "LAM" denotes a DNA sequence coding for a lamB signal sequence. "IGF-I/ denotes an IGF-I 
coding sequence comprising yeast preferred codons, "IGF-Ie" denotes a IGF-I coding sequence comprising E. 
coli preferred codons and "T" denotes a transcription termination sequence. Relevant restriction 
endonuclease sites are also shown. The DNA regions marked as described above are for diagrammatic 
purposes only and are not drawn to scale unless otherwise noted. 

Figure 1 depicts the construction of a synthetic double-stranded <ds) IGF-I DNA coding sequence. The 
four chemically synthesized oligonucleotide portions of the synthetic coding sequence are shown In 
capital letters and are numbered IGF-I (1) (2) (3) (4). Regions of overlap between the chemically 
synthesized oligomers are denoted by a solid line (-) between complementary nucleotides. The lower 
case tetters denote enzymaticalry generated oligonucleotides. The amino acid sequence of the IGF-I 
peptide encoded in the synthetic dsDNA sequence Is shown In an amino {NHzMerminal to carboxyl 
(COOH) -terminal orientation. 

Figure 2 depicts the construction of pMON2108. an IGF-I expression vector comprising a DNA 
sequence coding for a lamB signal sequence operatively joined to an IGF-I coding sequence consisting of 
E. co//preferred codons. 

Figure 3A&;8 depict the construction of pMON2123, an IGF-I expression vector comprising a DNA 
sequence coding for an ompF signal sequence operatively Joined to an IGF-I coding sequence consisting 
of €, Jcoff preferred codons. "ompr denotes the DNA coding sequence for the ompFslgnal sequence. 

Detailed Description of the Invention 

As Used herein, the symbols representing amino acids (i.e. Ala for alanine) and nucleotides (e.g. C.GATJ 
are those conventionally employed (Lehninger, 1976). 

The present invention provides a means for producing IGF-I in Gram-negative bacteria whereby the IGF-I so 
produced is secreted Into the periplasmic space of the bacteria. The discovery of such a means Is significant 
as It allows for the direct production of mature IGF-I free from any undesirable amino acids and allows for an 
accumulation of mature protein In a readily accessible site and at a site free from harmful cytoplasmic 
proteases. The term "mature" when used to characterize an IGF-t protein means a protein essentially free from 
any extraneous NH2- or carboxyt(COOH) -terminal amino acids. An example of such a protein Is one-having the 
primary amino acid sequence as shown in Figure 1. Furthermore, the processes and compositions of the 
present Invention provide for production of IGF-I at relatively high levels (e.g. 4 to 8<>A> of total cellular protein 
produced). 

The signal sequences which were discovered to effect high level production and secretion of accurately 
processed IGF-I are two signal sequences of major E. coli outer membrane proteins. E. coll possess a plurality 
of tnajor outer membrane proteins Including, but not limited to, lamB, ompF;ompA % ompC and phoE. These 
proteins are all synthesized In the cytoplasm and transported to their outer membrane location. As such, these 
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proteins all contain a signal sequence which Isprocessed either before or during transport of the mature outer 
membrane protein from the cytoplasm to Its noncyloplasmic location As described more fully hereinafter, the 
signal sequences ot the E. coli lamB and ompF proteins provide accurate processing of the signal sequence 
when operatlvely Joined to IGF-I peptides. The tamS and ompF signal sequences also provide for secretion and 
5 accumulation of mature IGF-I In the periplasmic space of the Gram-negative (e.g. E. coli) host cells and allow 
high level production of IGF-I by the transformed host. Based upon these findings, it Is believed that other E. 
coli major outer membrane protein signal sequences can effect the secretion and accumulation of mature 
IGF-I In the periplasmic space. 

It Is known that all Gram-negative bacteria possesses an outer membrane. It is also known that most 
w members of the family Enterobacteriaceae possess a maltose-lnducible outer membrane protein homologous 
to the lamB protein of E. coli {see Bloch and Desaymard, 1985. Incorporated herein by reference hereto). 

Thus. It Is expected that at least some of the lamB signal sequences of other Enterobacteriaceae, such as 
Shigella. Enterobacter, and Klebsiella can effect secretion of mature IGF-I Into the periplasmic space 
particularly when the host cell belongs to the same or sufficiently homologous genus and species from which 
15 the outer membrane protein signal sequence was derived. 

In one embodiment of the present invention employing an E. coli expression system described more fully 
hereinafter, the preferred lamB signal sequence comprised the following amino acid sequence: 

20 1 10 

NH 2 "Met-Met-Ile-Thr-Leu-Arg-Lys-Leu-Pro-Leu-Ala«Val- 



25 



35 



AO 



20 

Ala-Val-Ala-Ala-Gly-Val-Met-Ser-Ala-Gla-Ala-Met-Ala- 
C00H; 



30 and the preferred ompF signal sequence comprised the following amino acid sequence: 



1 10 
NH 2 -Met-Met-Lys-Arg^Asn-Ile-Leu-Ala-Val-Ile-Val-Pro- 

20 

Ala-Leu-Leu-Val-Ala-Gly-Thr-Ala-Asn-ALa-COOH. 



Significantly, several other E. coli non-outer membrane protein signal sequences reported to effectively 
secrete E. coli and/or heterologous proteins into the periplasmic space were found to be unable to secrete 
accurately processed mature IGF-I Into the periplasmic space and/or did so at levels significantly lower than 
the lamB or ompF systems. The failure of such other signal sequences to effect secretion of mature IGF-I Into 

45 the periplasmic space to a significant degree is In keeping with prior reports of Inabilities to predict the 
secretion efficacy of a given signal sequence for a given peptide. 

In order to effect secretion of mature IGF-I using the lamB or ompF signal sequences the signal sequences 
are operatlvely joined to the IGF-I peptide. For the lamB signal, such operative joining can be achieved by at 
least two distinct means. In one preferred means, the lamB signal sequence was directly joined (e.g. by 

50 peptide bonding) at Its carboxyi-terminus to the amlno-termlnus of the IGF-I peptide. The lamB signal 
sequence and IGF-I peptide are thus contiguous with one another. Alternatively, the lamB signal sequence Is 
operatlvely joined to the IGF-I peptide by creating a hybrid (heterologous) protein containing sequentially from 
the NH 2 -terrnlnus a lamB signal sequence, at least 39 to 43 of the NH 2 -terminal amino acids of the £1 coli lamB 
protein and an IGF-I peptide (see Silhayy, et al.. 1977; SHhavy et al., 1976). The DNA sequence of the E. coli 

55 tamB protein has been published by Clement and Hofnung (1981) which publication Is hereby incorporated 
herein by reference hereto. In a preferred embodiment, the NH 2 -terminal portion of the mature lamB protein 
comprised the following amino acid sequence: 

60 



65 
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NH 2 -Val-Asp-Phe-His-Gly-Tyr-Ala-Arg-Ser-Gly-Ile-Gly-Trp-Thr- 
Gly-Ser-Gly-Gly-Glu-Gln-Gln-Cys-Phc-Gln-Thr-Thr-Gly-Ala- 
Gln-Ser-Lys-Tyr-Arg-Leu-Gly-Asn-Glu-Cys-Glu-COOH. 



w 



15 



20 



25 



In both Instances, when employing the expression vectors and systems hereinafter described, mature IGF-I 
Is successfully secreted Into the periplasmlc space of the Gram-negative host cell and Is produced at high 
levels (e.g. 4-8% of total cellular protein produced). 

In one embodiment of the present Invention, operative joining of an ompF signal sequence to an IGF-I 
peptide Is achieved by similarly constructing (e.g. producing) heterologous proteins consisting of an ompF 
signal sequence at the NH 2 -terminus and an IGF-I peptide at the COOH-terminus so that the signal sequence 
and IGF-I peptide are contiguous with one another. These heterologous proteins successfully secreted 
mature IGF-I Into the periplasmlc space of the Gram -negative host cells. Furthermore, the IGF-I so secreted is 
accumulated at levels similar to those achieved using a lamB signal sequence. 

Having discovered that such major outer membrane protein signal sequences as the lamB and ompF signal 
sequences effect the secretion of mature IGF-I Into the periplasmlc space of such Gram-negative bacteria as 
E. coli, it is now possible to use conventional recombinant DNA techniques to produce the heterologous 
proteins described above (e.g. proteins consisting of a lamB or ompFsignal sequence operativety |oined to an 
IGF-I peptide). t a 

In one embodiment of the present Invention, production of heterologous proteins comprising a lamB or 
ompF signal sequence operativety joined to an IGF-I peptide Is achieved as follows. A synthetic gene Is 
constructed which contained the following components sequentially aligned in a 5' to 3' orientation: a 
promoter, a rlbosome binding site, a translation start/signal codon contiguous with a DN A sequence encoding 
the heterologous protein sought to be produced and a translation stop codon. Such a synthetic gene can be 
constructed by conventional recombinant DNA techniques (see Manlatis et al., 1982) and by employing the ^ 
methods and compositions set forth herein. The term "gene* refers to DNA sequences which provide for 
production of proteln(s) encoded therein. The term "synthetic* is understood to mean a chemically and/or 
enzymatically generated nucleic acid sequence or molecule free from naturally occurring intervening or 
contiguous sequences. Also embraced in the term "synthetic* are chemically and/or enzymatically generated 
molecules or sequences which are combined with nucleic acid sequences or molecules with which they are 
not naturally found associated or combined. Examples of synthetic genes, molecules or sequences include, 
but are not limited to, nucleic acid sequences enzymatically or chemically released from a larger (e.g. naturally 
occurring) molecule and genomic DNA which has been modified by nucleic acid addition, deletion and/or 
rearrangement. The term "genome* or "genomic* as used herein means the total DNA (e.g. chromosomal and 
extrachromosomal) contained within a given cell. 4Q 

In constructing a synthetic gene or genes which encode a heterologous protein comprising a lamB or ompF 
signal sequence operattvery Joined to an IGF-I peptide, any promoter capable of causing transcription in 
Gram-negative bacteria of such a gene or genes can be employed. Examples of such promoters are well 
known and available to those skilled In the art and include, without limitation, bacterial, viral and plasmld gene 
promoters (Rosenberg and Court. 1979). DNA molecules containing such promoter sequences can be ^ 
Isolated either by cloning the desired regions or. more conveniently, by chemically synthesizing them. A 
preferred promoter is the E. coll recA promoter, the sequence of which has been published (see Senear &; 
Rupp 197i9' and Sancar et al. 1980) and is furthermore contained on plasmld pMON2690 as a BamHI 
restriction fragment (see Figure 2). E. coffcarrying plasmld pMON2690 have been deposited with the American 
• Type Culture" Collection (Rockvtlle, Maryland) under accession number 67325. ^ 
Similarly, DNA sequences encoding the desired heterologous proteins which proteins can effect the 
secretion of IGF-I and preferably, mature IGF-I peptides Into the periplasmlc space, can also be chemically or 
enzymatically synthesized. The sequences of the lamB and ompF signal sequences have been published 
(Silhavy et al 1983; Nagahari et al., 1985) as have the sequences of the lamB and ompF proteins (Inokuchl, K. 
et al 1982, 1984; Silhavy. T. et al.. 1983). Thus, DNA sequences encoding these proteins or any desired ^ 
fragment thereof can be synthesized or Isolated from appropriate gene or cDNA libraries by conventional 
methods. Thereafter, the synthetic DNA sequence encoding the desired lamB or ompF signal sequence can 
be operatively joined, by such means as ligase, to a DNA sequence encoding IGF-I. 

The primary amino acid sequence of IGF-I has been published (Rlnderknecht and Humbel. 1978) and Is 
shown In Figure 1. From the primary amino acid sequence, a plurality of DNA coding sequences can be ^ 
constructed based upon the genetic code. Selection of the specific amino acid codons to be employed In 
generating an IGF-I DNA coding sequence is generally guided by such factors as determining a sequence 
which will optimize transcription, translation and. ultimately, the amount of protein produced. Selection of 
codons Is thus, for example, guided by a desire to avoid potential secondary structure constraints, a preferred 
use of codons by the host cell (Fiers et al.. 1976; Maruyama, T. et al.. 1986; Sharp, P. et al.. 1986; Bennetzen. J. ^ 
and HaB, B.. 1982; Gouy. M. and Gautler. C. 1982) and/or the need to provide for desired restriction 
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i»„.,o„» ciiae An example of an IGF-I DNA coding sequence consisting of yeast preferred 
endonuclease cleavage sites. An «J° construction is detailed in the examples below. 

%Z*^££SS& be Isolaled from an appropriate cDNA iibrary by conventional 

'"^previously indicated, the synthetic genes of the present Invention contained a J^^^ '"^J 
D r^o«c systems a ribosome binding site such as a Shine-Dalgarno sequence (Shine and Dalgarno. 1974 
K2en ^demonsTrated to be required for efficient translation of messenger RNA molecules. Recently, a novel 
£!^1*^^eorrttm sequences from the gene 10 of bacteriophage T7 has been shown to be 

patin-uw y •riol seouences " are described more fully In EP-A-0 241 446. 

The oreCed synthe^c genes of the present Invention contained G10L sequences operatlve.y joined to 
0N Tsequen7es 2* I fo^he heteroio^ous proteins of the present Invention (e.g. proteins comprising a 

bacterial cells carrying (e.g. transformed with) the vector^ exchanged for alternative 

comprising a tonS or o/npF signal sequence operatlvely ^"^.^ ' J/ ' Strain SE6004 described 

accurately processed liy the host cell and Is biologically active. ^ nTWW ^oTA (ethvlenediami- 

PeripJsm.c proteins can then be ~ te <f^^ RX* I^ ^ze-thaw 

netetraacetic acid) treatment (Oliver and Bedwtth. 1982). •J^SSff 1986) or by osmotic shock (Oka 
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The levels of protein (e g. »GF-I produced) can be determined by such conventional assays as Western 
i^Sottlno radioirnmunoassay protein staining of a protein gel and/or enzymatic activity. Examples of 
aTeystvanlwe £ Sanation of IGF-I levels and bloactivity include, without limitation, radioimmunoassay 
"opetend e7al.. 1980). radioreceptor assay (DErcole et al.. 1976) and cell growth assays (St Res it al 1979 
KoTs C E 1984). By such means. It was determined lhal the IGF I p.oduced In accordance with the methods 
of ,he Resent Invention was biologically active and represented from about 3 to KXVo of the total produced by 

'^rmS^ 

outer membrane Into the media. Indeed. In the secretion systems described here<n. about 3 o 50/o of the IGF-I 
protein^produced can be purified from the growth medium. The percentage ^V^^^£^ 
culture medium can be Increased. If desired, by employing such methods descnbed in U.S. Paten 4.595.658 to 
zSS»7et al. Briefly, cells able to secrete the desired protein (e.g. IGF-I) Into the |^<»Mn 
genetically altered by insertion of the fl bacteriophage gene III into their genomes. Production of he gene fll 
pSn by Gram-negative bacteria such as E. excuses perturbations in the outer membrane whlchjn turn 
aSows periplasm* proteins to leak out of the cell and Into the media. Retrieval of proteins from the med.a can 
have advantages In such situations, as when continuous culture conditions are being employed. 

triSlSl have been deposited with the American Type Culture Collection (ATCQ. 
12301 ptklawn Drive. Rockvilte. Maryland 20852. U.S.A. in accordance with the Budapest Treaty: 

ATCC 53588 E. coli SE6004 

.nvlnUon mVerogation of patent rights granted by governmental action. Furthermore, the present Invention Is 
notTobe Hmltedln scope by the microorganisms deposited since the deposited embodiments are intended 
only as specific illustrations of the invention. 

Examples 30 

^^^o^uSrw ere synthesized employing a 380A Apptied Biosys.ems .nc. DNA syntt^er In 
acJordanc^ with the procedure set forth by the ™^ c « u ™- A ^ 

California). Unless otherwise noted all specialty chemicals were obtained ^J^^^^^*^^ 
Restriction enzymes and DNA modifying enzymes were purchased from New England Bio tabs > ^eveny 
Massachusetts? New Engtand Nuctear (Boston. ^s^" 8 ^^ 

(Galthersburg. Maryland) and used In accordance wrth manufacturers Erections. T4 »NA ligase i was 
ourchased from Promega Biotec (Madison. Wisconsin) and used In accordance with nmAdurari 
speStons «P Tbeled nucleotides and .'"-.abe.ed protein A were purchased from Amersham (ArHng on 
ffSS.). £ coli strains SE2060 and SE6004 were obtained from Dr. T*^*^ J**^** 
Molecular Biology. Prlr.ce.on University. Princeton. New Jersey 08544 and «;e descnbed by Em ^ I.U 19811 
E coffstraln SE6004 carrying a priA4 mutation can be obtained from the ATCC 1*°****- ^™ d J " T ™ 
ATCC accession numbe/ 53588 £ coli strains W3110. 294 and JM101 can be obtained 'rom the ATCC 
S«» mX lunder ATCC accession numbers 27323. 31446 and 33876. respectively. A I baclerial 
£^tn t^Smnto and antibiotics were obtained from either Sigma (St. Louis. Missouri) or Drfco 
?S««ri« mTrMMaVnl PBR327 vectors can be obtained from Pharmacia (Piscataway. New Jersey). 
^!^^MlS^<^S^^ for selection of bacteria, cells carrying p.asmlds containing 
«n^nK re^teta^ce (amp') marker were as described in Mania.ls et al. (1982). When employed for protein 
!C^SSn B co/Kfre SSi taUurla Broth (LB) or M9 minimal medium (Manlatls et a... 1982) supplemented 
'J^^a^XZ^^on of k coll host cells with recombinant vectors was performed as 

r^rra^r iSon for heterologous protein analysis. A high level of aeration was maintained throughout the 

lri r^M^r to achieve maximal production of the desired gene product. 
9 ™p£sm£ protetns ^od^ctd in the E. coli host cells were obtained by an osmotic shock procedure 
„ c ^S« dSbedbvOka T etal (1985). Specifically, cells from 30 ml of culture were pelleted and the 
^ufenS a^^^^^^ vvas' cafled ,he"med.a fraction." The peOeted cells *er* then 

supemaiam coiib^wo y SUC rose and 30mM Tris pH 7.5. EDTA was then added to give a final EDTA 

[SSSrSSn o ?S£^££™£££tTZr^ incubated for 15 minutes at room temperature 

pe..e«ed. The resuiflng supema.ant was called the 'sucrose fraction. 
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The pelleted cells were resuspended In 3 ml of Ice water (about 0-4°C) and shaken gently lor 10 minutest an 
Ice bath and then pelleted. The resulting pellet and supernatant were called the "lysate* and 'water wash,' 
respectively. Typically, periplasmlc proteins obtained by an osmotic shock procedure are found In the water 
wash fraction. The periplasmlc IGF-I, however, was predominantly found In the sucrose fraction. 
5 IGF-I was purllled from the media as follows. Bacterial cells were removed by peHetlng or passing the media 
through a 0.22 u, membrane filter (Mlfflpore, Inc.; Bedford. Mass.) after which the media was concentrated 
using a YM-3 membrane (Amacon Filter Co.. Oanvers. Mass.) and the protein then precipitated with 60<¥o (w/v) 
ammonium sutfate. The precipitate was then pelleted, the pellet dissolved In 50 mM sodium formate (pH 4.2) 
and then desalted on a Sephadex G-25 column (Pharmacia. Piscataway. New Jersey) after which the eluate 
W was subjected to cation exchange chromatography on a mono S column (Pharmacia, Piscataway. New Jersey) 
from which the IGF-I was eluted with a 0-1M sodium chloride gradient. The protein eluted at about 400 to 
500mM Nacl. The IGF-I containing fractions from the mono S column were then subjected to reverse phase 
high performance liquid chromatography on a Brownlee C8 RP300 Aquapore column (Brownlee Laboratories: 
Santa Clara. Calff.) from which the IGF-I was eluted with an acetonltrile-trifluoroacetlc acid (TFA) gradient 
15 containing OA°/o (v/v) TFA. The IGF-I eluted at about 30<Vb (v/v) acetonltrlle. The recovered IGF-I was then 
stored at -20 to -80° C as a lyophitized powder. 

The levels of IGF-I produced by the bacterial host cells were determined by protein-specific assays such as 
protein staining (with Coomassie Brilliant Blue R250) of 15<Vo (w/v) sodium dodecyl sulfate (SDS)-poryacryla- 
mlde gels (Laemmli. 1970) or by Wes tern Immunoblottlng as described by Renart et al. (1979). Specifically. 
SO total protein produced within the host cell (cytoplasmic and periplasmlc), protein secreted Into the media and 
protein secreted only Into the periplasmlc space were determined as follows. Pelleted cells from 1 ml of culture 
were dissolved In SOS sample buffer as described by Laemmli (1970) to a final concentration of 1 Klett unit per 
uJ. This sample represented the total protein produced within the host cell. The media fraction, sucrose 
fraction, water wash and lysate (all described above) were Individually precipitated with trichloroacetic acid 
25 (TCA) and then redlssorved in SDS sample buffer to a final concentration of 1 Klett unlt/pl. All samples in SDS 
sample buffer were heated at 90° C to 100° C for 5 minutes prior to electrophoresis In 15<Wo (w/v) SOS 
poryacrytamlde gels (Laemmli. 1970). The Coomassie BrPliant Blue protein-stained gels were scanned by a 
Joyce Loeble chromoscan 3 scanning densitometer (Joyce Loeble. Inc., New Castle, Great Britain). 
IGF-I specific antibody was prepared In accordance with the method of Vaitukaitis. J. L. (1981). Briefly, 
30 female New Zealand white rabbits were Immunized intradermally with about 100 ug of IGF-I In Freund's 
Complete Adjuvant. The rabbits received Intradermal booster Injections of about 100 jig of IGF-I In Freund's 
Complete Adjuvant at 4 and 8 weeks. 

Amino acid sequence analysis of bacterially produced heterologous proteins was performed using an 
Applied Blosystems, Inc. (Foster City, California) Model 470A gas phase protein sequencer according to the 
35 methods described by Hunkapllleret al. (1983). The respective phenylthlodantoln (PTH)-amlno acid derivatives 
were Identified by reverse phase high performance liquid chromatography In an on-line fashion employing an 
Applied Blosystems. Inc. (Foster City. California) Model 120A PTH Analyzer fitted with a Brownlee 2.1 mm Inner 
diameter PTH-C 18 column. 

The bloactMty of IGF-I peptides isolated from the periplasmlc space of recombinant gram negative bacteria 
40 was established by the rat L6 myoblast proliferation assay described by Kotts. C.E. (1984) and U.S. patent 
application Serial No. 777,117 which is Incorporated herein by reference hereto. Specifically, IGF-I contained 
within the sucrose fraction following osmotic shock isolation was compared to a commercial preparation of 
human IGF-I obtained from Amgen, Inc. (Thousand Oaks. California. Lot 407) for demonstrable physiological 
activity In that assay. In order to minimize or eliminate any effects of sucrose In the IB myoblast proliferation 
45 assay, the sucrose fraction samples (e.g. 2ml of each sample) were first diafyzed versus one liter 44mM 
NaHCOs, pH 7.8, at 4°C. with three changes of diatysate over a three day period. The dialysis tubing had a 
molecular weight cut-off of 1000 kllodaltons. In an rat U6 myoblast proliferation assays conducted with the 
periplasmlc IGF-I of the present Invention, the IGF-I exhibited a statistically significant biological activity. 

Radioimmunoassay analysis of both the dlalyzed and undialyzed sucrose fraction was performed using a 
SO somatomedin C radioimmunoassay kit obtained from Nichols Institute (San Juan Capistrano, California) In 
accordance with manufacturer's Instructions. 

Example 1 

This example describes the construction and assembly of the synthetic double-stranded IGF-I coding 
55 sequence shown In Fig. 1. In this Example 1, the coding portion of the sequence predominantly comprises 
yeast preferred codons as determined from the codon usage pattern In the yeast 3-phosphogtycerate kinase 
gene (Genbank Data base; see BOofsky et al.. 1986). Less preferred codons were also employed at some sites 
in the sequence in order to create or remove restriction sites and/or to minimize the possibility of competing 
secondary structure within the Individual oligonucleotides. Oligonucleotides were synthesized as previously 
60 described and their respective concentrations determined using quantitative 5'-end labeling with y 32 P-ATP 
followed by separation of unincorporated y 32 P-ATP using ascending paper chromatography. 

Construction and assembly of the synthetic IGF-I coding sequence was conducted In accordance with the 
methods described by Scarpulla et al. (1982); Rossi et al. (1982); and Adams &; Galtuppl (1986). Specificlally. 
In order to produce the synthetic dsONA sequence shown In Figure 1. four partially overlapping synthetic 
€5 oligonucleotides, designated IGF-I (1), (2). (3) and (4) and denoted by the capital letters, were synthesized. 
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proteins In place of the « or ompF signal sequences were also constructed In accordance with the 
recombinant DNA techniques described herein. 

£X T PI ! ,e, .n« o EG004 SE2060 W3110 JM101 and/or 294 were Individually transformed (Manlatls et al.. 
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Claims 



1. A method for producing IGF-t which comprises: causing expression of a gene in Gram-negative 
bacteria, said gene comprising a first DNA sequence encoding a lamB or ompF signal sequence which 
first ONA sequence Is operatlvely Joined to a second DNA sequence encoding IGF-I. and producing IGF-I 
which is secreted Into the perlplasmlc space of the bacteria. 

2. The method of Claim 1 wherein the bacteria Is E. coll. 

3. The method of Claim 2 wherein the E coll Is an £. cot/ having the identifying characteristics of ATCC 
accession number 53588. 

4. The method of Claim 1 wherein the first DNA sequence is contiguous with the second DNA 
sequence. 

5. The method of Claim 1 or 4 wherein the signal sequence is an ompF signal sequence. 

6. The method of Claim 1 or 4 wherein the signal sequence is a lamB signal sequence. 

7. The method of Claim 1 wherein the gene Is carried on an extrachromosomal DNA molecule. 

8. The method of Claim 7 wherein the extrachromosomal DNA molecule Is a plasm Id DNA. 

9. The method of Claim 8 wherein the ptasmld DNA is selected from a group consisting of plasm id 
pMON2108. pMON2690 and pMON2123. 

10. The method of Claim 7 wherein the extrachromosomal DNA also carries a selectable marker. 

1 1 . The method of Claim 1 wherein the IGF-I Is also secreted Into the medium. 

12. The method of Claim 1 wherein the gene further comprises a G10L sequence. 

13. The method of Claim 1 wherein the gene further comprises a transcription termination sequence. 

14. A method for producing IGF-t which comprises: causing expression of a gene In Gram-negative 
bacteria, said gene comprising sequentially a DNA sequence encoding a promoter operable in 
Gram-negative bacteria, a G10L sequence, a DNA sequence encoding a lamB signal sequence, a DNA 
sequence encoding IGF-I and a transcription termination sequence, and producing IGF-I which Is 
secreted Into the periplasmic space of the Gram-negative bacteria. 

15. A method for producing IGF-I which comprises: causing expression of a gene In Gram-negative 
bacteria, said gene comprising sequentially a DNA sequence encoding a promoter operable in 
Gram-negative bacteria, a G10L sequence, a DNA sequence encoding an ompF signal sequence, a DNA 
sequence encoding IGF-I and a transcription termination sequence, and producing IGF-I which is 
secreted into the periplasmic space of the Gram-negative bacteria. 

16. The method of Claim 14 or 15 wherein the promoter is recA. 

17. The method of Claim 14 or 15 wherein the transcription termination sequence is a bacteriophage T7 
gene 10 terminator sequence. 

18. The method of Claim 14 or 15 wherein the IGF-I Is also secreted into the medium. 

19. A synthetic DNA molecule comprising a first DNA sequence encoding a signal sequence operatlvely 
Joined to a second DNA sequence encoding IGF-I. 

20. The DNA molecule of Claim 19 wherein the 3' end of the first DNA sequence Is contiguous with the 5' 
end of the second DNA sequence. 

21. A synthetic DNA molecule comprising a first DNA sequence encoding an ompF signal sequence 
operatively joined to a second DNA sequence encoding IGF-I. 

22. The DNA molecule of Claim 21 wherein the 3* end of the first DNA sequence is contiguous with the 5' 
end of the second DNA sequence. 

23. A synthetic DNA molecule comprising a gene expressable In Gram-negative bacteria, said gene 
comprising a first DNA sequence encoding a lamB oro/npFsignal sequence which first DNA sequence Is 
operative !y |oined to a second DNA sequence encoding IGF-I. 

24. The DNA molecule of Claim 23 wherein the gene comprises sequentially a recA promoter, a G10L 
sequence, a DNA sequence encoding a lamB signal sequence, a DNA sequence encoding IGF-I and a 
transcription termination sequence. 

25. The DNA molecule of Claim 23 wherein the gene comprises sequentially a recA promoter, a G10L 
sequence, a DNA sequence encoding a ompF signal sequence, a DNA sequence encoding IGF-I and a 
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transcription termination sequence. 

26. A transformed Gram-negative bacteria containing the DNA molecule of any of Claims 19. 20, 21. 22. 
23.24 or 25. 

27. A transformed E. co//contalnlng the DNA molecule of any of Claims 19. 20, 21 , 22. 23. 24 or 25. 

28. An E. co//havlng the Identifying characteristics of ATCC accession number 53588 and containing the 
synthetic DNA molecule of any of Claims 19. 20. 21 , 22, 23. 24 or 25. 

29. An E. coti containing a plasmld DNA molecule selected from the group consisting of pMON2108. 
pMON2690 and 2123. 

30. The E. cofiot Claim 29 having the Identifying characteristics of ATCC accession number 67325. 
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1. MIX: SYNTHETIC ompF CODING SEQUENCE 
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